Robot Weightlifting By Direct Policy Search
نویسندگان
چکیده
This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologically motivated constraints, offers an effective means for motor skill acquisition. The framework makes use of the robot counterparts to several elements found in human motor learning: imitation, equilibrium-point control, motor programs, and synergies. We demonstrate that through learning, coordinated behavior emerges from initial, crude knowledge about a difficult robot weightlifting task.
منابع مشابه
Advancing student interaction with a learning robot: Digital twin, connectivity, and augmented reality
During interaction with learning robots, students often experience difficulties understanding the robot intent and its practical realization. To address this challenge, we propose a connected environment that integrates the robot, its digital twin and virtual sensors. The environment supports digital experiments that enable the physical robot to determine optimal policy for performing a manipul...
متن کاملReward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning
Direct policy search is a promising reinforcement learning framework, in particular for controlling continuous, high-dimensional systems. Policy search often requires a large number of samples for obtaining a stable policy update estimator, and this is prohibitive when the sampling cost is expensive. In this letter, we extend an expectation-maximization-based policy search method so that previo...
متن کاملEfficient Sample Reuse in EM-Based Policy Search
Direct policy search is a promising reinforcement learning framework in particular for controlling in continuous, high-dimensional systems such as anthropomorphic robots. Policy search often requires a large number of samples for obtaining a stable policy update estimator due to its high flexibility. However, this is prohibitive when the sampling cost is expensive. In this paper, we extend an E...
متن کاملWeightlifting Motion Planning for a Puma 762 Robot
In this paper we develop a point-to-point weightlifting motion planner for open-chained rvbota. The joint trajectm”es am defined b~ B-spline pol~omials along with a time-scale factor. Ph@al limitations of a Puma 762 robot a~ incorpomted into the formulation. The torque limits are formulated aa a Penaltg function (soft constminta) added into the objeetive function while the position and velocity...
متن کاملEvolutionary Policy Transfer and Search Methods for Boosting Behavior Quality: RoboCup Keep-Away Case Study
This study evaluates various evolutionary search methods to direct neural controller evolution in company with policy (behavior) transfer across increasingly complex collective robotic (RoboCup keep-away) tasks. Robot behaviors are first evolved in a source task and then transferred for further evolution to more complex target tasks. Evolutionary search methods tested include objective-based se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001